Universal Option Models
نویسندگان
چکیده
We consider the problem of learning models of options for real-time abstract planning, in the setting where reward functions can be specified at any time and their expected returns must be efficiently computed. We introduce a new model for an option that is independent of any reward function, called the universal option model (UOM). We prove that the UOM of an option can construct a traditional option model given a reward function, and also supports efficient computation of the option-conditional return. We extend the UOM to linear function approximation, and we show the UOM gives the TD solution of option returns and the value function of a policy over options. We provide a stochastic approximation algorithm for incrementally learning UOMs from data and prove its consistency. We demonstrate our method in two domains. The first domain is a real-time strategy game, where the controller must select the best game unit to accomplish a dynamically-specified task. The second domain is article recommendation, where each user query defines a new reward function and an article’s relevance is the expected return from following a policy that follows the citations between articles. Our experiments show that UOMs are substantially more efficient than previously known methods for evaluating option returns and policies over options.
منابع مشابه
Option Pricing on Commodity Prices Using Jump Diffusion Models
In this paper, we aim at developing a model for option pricing to reduce the risks associated with Ethiopian commodity prices fluctuations. We used the daily closed Unwashed Lekempti grade 5 (ULK5) coffee and Whitish Wollega Sesame Seed Grade3 (WWSS3) prices obtained from Ethiopia commodity exchange (ECX) market to analyse the prices fluctuations.The natures of log-returns of the prices exhibit a...
متن کاملNumerical Methods of Option Pricing for Two Specific Models of Electricity Prices
In this work, two models are proposed for electricity prices as energy commodity prices which in addition to mean-reverting properties have jumps and spikes, due to non-storability of electricity. The models are simulated using an Euler scheme, and then the Monte-Carlo method is used to estimate the expectation of the discounted cash-flow under historical probability, which is considered as the...
متن کاملHow Does Pricing of Day-ahead Electricity Market Affect Put Option Pricing?
In this paper, impacts of day-ahead market pricing on behavior of producers and consumers in option and day-ahead markets and on option pricing are studied. To this end, two comprehensive equilibrium models for joint put option and day-ahead markets under pay-as-bid and uniform pricing in day-ahead market are presented, respectively. Interaction between put option and day-ahead markets, uncerta...
متن کاملA New Stock Model for Option Pricing in Uncertain Environment
The option-pricing problem is always an important part in modern finance. Assuming that the stock diffusion is a constant, some literature has introduced many stock models and given corresponding option pricing formulas within the framework of the uncertainty theory. In this paper, we propose a new stock model with uncertain stock diffusion for uncertain markets. Some option pricing formulas on...
متن کاملارائه یک روش تلفیقی جهت قیمت گذاری اختیار معامله مبتنی بر دو مدل بلک شولز و درخت دوتایی (مطالعه موردی بازار بورس سهام ایران)
This paper suggests a composed option pricing model based on black-scholes and binomial tree models. So at first this two models are presented and analyzed. Then we showed black-scholes model is an appropriate option pricing model for stocks with low volatility and binomial trees model is an appropriate option pricing model for stocks with high volatility. Suggested model is a composed model of...
متن کامل